GSM speech coding and speaker recognition

نویسندگان

  • Laurent Besacier
  • Sara Grassi
  • Alain Dufaux
  • Michael Ansorge
  • Fausto Pellandini
چکیده

This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker recognition from coded speech in matched and mismatched conditions

We investigate the effect of speech coding on automatic speaker recognition when training and testing conditions are matched and mismatched. Experiments use standard speech coding algorithms (GSM, G.729, G.723, MELP) and a speaker recognition system based on Gaussian mixture models adapted from a universal background model. There is little loss in recognition performance for toll quality speech...

متن کامل

Speaker verification with GSM coded telephone speech

In this paper we investigate the impact on the performance of Speaker Veriication (SV) systems of the signal and channel coding in GSM cellular telephone networks. In this study only the eeects of the codec are investigated. This is done by transcoding the signals in an existing speech corpus, recorded in the xed network, to GSM. We compared text dependent SV performance of systems trained with...

متن کامل

Overview of compression and packet loss effects in speech biometrics - Vision, Image and Signal Processing, IEE Proceedings-

An overview is presented of compression and packet loss effects in speech biometrics. These new problems appear particularly in recent applications of biometrics over mobile or Internet networks. The influence of speech compression on speaker recognition performance in mobile networks is investigated. In a first experiment, it is found that the use of GSM coding degrades the performance. In a s...

متن کامل

Speaker recognition on lossy compressed speech using the speex codec

This paper examines the impact of lossy speech coding with Speex on GMM-UBM speaker recognition (SR). Audio from 120 speakers was compressed with Speex into twelve data sets, each with a different level of compression quality from 0 (most compressed) to 10 (least), plus uncompressed. Experiments looked at performance under matched and mismatched compression conditions, using models conditioned ...

متن کامل

LOW−COMPLEXITY AUTOMATIC SPEAKER RECOGNITION IN THE COMPRESSED GSM AMR DOMAIN (WedAmOR2)

This paper presents an experimental implementation of a low−complexity speaker recognition algorithm working in the compressed speech domain. The goal is to perform speaker modeling and identification without the need of decoding the speech bitstream to extract speaker dependent features, thus saving important system resources, for instance, in mobile battery powered DSP devices. Bitstream valu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000